19:08
2026-06-28
lesswrong.com
ai-safety
Anthropomorphic Misalignment research needs stronger evidence
Researchers at ETH Zurich argue in a new ICML 2026 position paper that AI safety studies on anthropomorphic behaviors like deception and scheming lack rigorous evidence, risking misallocated resources…